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Abstract 

The correlation matrix formalism is used to study temporal aspects of the stock 
market evolution. This formalism allows to decompose the financial dynamics into 
noise as well as into some coherent repeatable intraday structures. The present study 
is based on the hiyh- frequency Deutsche Aktienindex (DAX) data over the time period 
between November 1997 and September 1999, and makes use of both, the correspond- 
ing returns as well as volatility variations. One principal conclusion is that a bulk of 
the stock market dynamics is governed by the uncorrelated noise-like processes. There 
exists however a small number of components of coherent short term repeatable struc- 
tures in fluctuations that may generate some memory effects seen in the standard 
autocorrelation function analysis. Laws that govern fluctuations associated with those 
various components are different, which indicates an extremely complex character of 
the financial fluctuations. 

1 Introduction 

One of the great challenges of econophysics is to properly quantify and, follow- 
ing this, to explain the nature of financial correlations and fluctuations. The 
efficient market hypothesis |TJ implies that they are dominated by noise. In- 
deed, the spectrum of the correlation matrix accounting for correlations among 
the stock market companies agrees very well pi |]] with the universal predic- 
tions of random matrix theory ||, |j. Locations of extreme eigenvalues differ 
however from these predictions and thus identify certain system-specific, non- 
random properties such as collectivity. In addition, these former properties 
turn out ||], [?J to depend on time reflecting a competitive character of the 
financial dynamics. 
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Figure 1: The Deutsche Aktienindex (DAX) in the calendar period November 
28, 1997 - September 17, 1999. 



The character of the financial time correlations is however very complex, 
still poorly understood and many related issues remain puzzling. The auto- 
correlation function of the financial time series, for instance, drops down to 
zero within few minutes which is interpreted as a time horizon of the market 
inefficiency ||, At the same time, however, the correlations in volatility are 
significantly positive over the time intervals longer by many orders of mag- 
nitude. The fat-tailed return distributions seem to be not Levy stable [[| on 
short time scales, but on longer time scales it appears difficult to identify 
their convergence to a Gaussian as expected from the central limit theorem. 
In addressing this sort of issues below we use the concept of the correlation 
matrix whose entries are constructed from the time series of price changes rep- 
resenting the consecutive trading days. The method focuses then entirely on 
the time correlations and their potential existence can parallelly be detected 
on various time scales. Analogous methodology has already been successfully 
applied [10| to extract from noise some repeatable structures in the brain sen- 
sory response, and its somewhat similar variant, the correlation matrix of the 
delay matrix, to study the business cycles of economics [11]. The present 
study is an extension of our recent work [12] and is based on an example of 
high-frequency (15 s) recordings [13]. As it can be seen from Fig. [I], this is 
an interesting period which comprises the whole richness of the stock market 
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dynamics like strong increases and decreases, and even a clearly identifiable 
hierarchy of the log-periodic structures 14]. 



2 Definiton of correlations 

In the present application the entries of the correlation matrix are constructed 
from the time-series g a (U) °f normalized price returns representing the consec- 
utive trading days labelled by a. Starting from the original price time-series 
x a (t) these are defined as 

G a (ti) - (G a (ti)) 

with 



Qaik) = ~ a "" %> ff '" aw,t , a{G a ) = ^{Gl{t)) t - <<?«(*)>? , (1) 
a(G a ) v 



G a (U) = lnx a ( tl + r) - lnx a (U) ~ X "& + T ) x °>&) , (2) 

•Ea (Pi ) 

where r is the time-lag and (. . .)t denotes averaging over time. 

The result is N time series g a {tj) of length T (the number of records during 

the day) i.e. an N xT matrix M. The correlation matrix can then be defined 

as 

C = (1/T) MM T . (3) 

Its entries C Qj( y are thus labelled by the pairs of different days. By diagonal- 
izing C 

Cv fc = A fc v fc , (4) 

one obtains the eigenvalues (k = 1, —,N) and the corresponding eigenvec- 
tors v fc = {v^}- 

A useful null hypothesis is provided by the limiting case of entirely random 
correlations. In this case the density of eigenvalues pcW defined as 

1 rira(A) 

pc{x) = iV^T' (5) 

where n(A) is the number of eigenvalues of C less than A, is known analyti- 
cally [jl5| , and reads 

Q Vi^max — A) (A — Xmin) /„v 

pcW = W X ' (6) 



AS = a 2 (l + l/Qi2^1/g), 

with Xmin < A < X max , Q = T/N > 1, and where a 2 is equal to the variance 
of the time series which in our case equals unity. 
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Figure 2: The probability density (histogram) of the eigenvalues of the corre- 
lation matrix C calculated from the DAX time series of 15 s returns during the 
calendar period November 28, 1997 - September 17, 1999. The null hypothesis 
of purely random correlations formulated in terms of Eq. (o) is indicated by 
the dashed line. 

3 DAX time-correlations 

As mentioned above our related study is based on the DAX recordings with 
the frequency of 15 s during the period between November 28th, 1997 and 
September 17th, 1999. After this last date the DAX was traded significantly 
longer during the trading day. By taking the DAX intraday 15 s variation 
between the trading time 9:03 and 17:10 which corresponds to T = 1948, and 
rejecting several days with incomplete recordings, one then obtains A = 451 
complete and equivalent time series representing different trading days during 
this calendar period. Using this set of data we then construct the 451 x 451 
matrix C. 

One characteristic of interest is the structure of eigenspectrum. The re- 
sulting probability density of eigenvalues, shown in Fig. |2|, displays a very 
interesting structure. There exist two almost degenerate eigenvalues visibly re- 
pelled from the bulk of the spectrum, i.e., well above X ma x (for Q = 1948/451, 
^max ~ 2.19) which indicates that the dynamics develops certain time specific 
repeatable structures in the intraday trading. The bulk of the spectrum, how- 
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ever, agrees remarkably well with the bounds prescribed by purely random 
correlations. This indicates that the statistical neighbouring recordings in our 
time series of 15 s DAX returns share essentially no common information. 

A significance of this result can be evaluated by the following numerical 
experiment. From a Gaussian distribution we draw N = 451 series x n (i) (n = 
1, N) of random numbers of length T = 1948 (i = 1, ...,T) and determine 
the spectrum of the resulting correlation matrix. The result (histogram) versus 
the corresponding theoretical result expressed by the Eq. (|6|) is shown in the 
upper part of Fig. [| As expected, the agreement is unqestionable. In the 
second step, in each previous series we retain only every third number, e.g., 
x n (l), x n (4), ... This omission is compensated by insertion between every two 
remaining original numbers, say x n (i) and x n (i + 3), the two new x n (i + 
l),x n (i + 2) numbers such that they are functionally (here linearly) dependent 
on x n (i) and x n (i + 3). The net result is the same number N of series of the 
same length T as before, thus Q = T/N formally remains unchanged. The 
structure of eigenspectrum of the corresponding correlation matrix, which is 
shown in the lower part of Fig. ||, changes however completely. In fact, it now 
perfectly agrees with the theoretical formula of Eq. (0) but for the three times 
shorter (T/3) series, i.e., it nicely reflects a real information content. From this 
we can conclude that a common information shared by neighbouring events in 
our DAX time series is basically null and that a whole nonrandomness can be 
associated with the two largest eigenvalues. 

It is also interesting to perform an analogous study of the volatility cor- 
relations. This corresponds to replacing G a {ti) in Eq. (||) by \G a (ti)\, for 
instance. The structure of eigenspectrum of the resulting correlation matrix 
is shown in Fig. Surprisingly, even in this case the bulk of the spectrum 
is consistent with purely random correlations. As compared to Fig. ^, one 
can now identify however three outlying eigenvalues and the largest of them 
is repelled significantly higher, as far up as 13.3. 

The structure of eigenspectrum of a matrix is expected to be related ]l6|, 



17 ] to the distribution of its elements. For this reason in Fig. y we show 



the distributions of such elements of C corresponding to the above specified 
procedure for our two cases under consideration. The upper part of this figure 
corresponds to the returns time series and the lower part to the volatility 
time series. In the first case this distribution is symmetric with respect to 
zero, a Gaussian like (dashed line) on the level of small matrix elements, but 
sizably thicker than a Gaussian on the level of large matrix elements, where 
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Figure 3: (a) The probability density (histogram) of eigenvalues of the cor- 
relation matrix C calculated from N = 451 series of length T = 1948 of the 
Gaussian distributed uncorrelated random numbers versus the corresponding 
null hypothesis (dashed line) formulated in terms of Eq. @. (b) The same 
as (a) but here only every third original random number in each series is re- 
tained. The removed pairs of numbers are replaced by the new ones which 
are functionally dependent on the neighbouring two original numbers. The 
dashed line indicates the result of Eq. (0) for N series of length T/3. 
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Figure 4: The same as in Fig. | but now the volatilities are used instead of 
returns. 

a power law with the index of about 5.5 - 5.7 (which however is far beyond 
the Levy stable regime as consistent with the distribution of returns) provides 
a reasonable representation. It is these tails which generate the two largest 
eigenvalues seen in Fig. |2[ The volatility correlation matrix, on the other 
hand, reveals a somewhat different distribution. First of all, the center of this 
distribution is shifted towards positive values and this is responsible for the 
largest eigenvalue. Secondly, this distribution is asymmetric. This originates 
from the fact that the volatility fluctuations are stongly asymmetric relative to 
their average value. The slope on the right hand side cannot be here reliably 
measured in terms of a single power law, but its even smaller value as compared 
to the previous case (of returns) is evident. On the other hand, on the negative 
side the distribution drops down faster than a Gaussian and, therefore, the 
separation between the two remaining large eigenvalues is significantly more 
pronounced than of their returns counterparts (two largest ones) from Fig. 0. 

In quantifying the differences among the eigenvectors it is instructive to 
look at the superposed time series of normalized returns. One possibility 
adopted here reads: 

N 

9X k (U) = si gn( V a)\Va\ 2 9*(U)- (7) 
a=l 
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Figure 5: Distribution of matrix elements C a a i of the N x N (N = 451) 
correlation matrix C calculated from the 15 s frequency DAX variation during 
the intraday trading time 9:03-17:10. a labels the different trading days during 
the calendar period December 28, 1997 - September 17, 1999. The upper (a) 
part corresponds to the time series of returns and the lower (b) part to the 
time series of volatilities. The solid lines in (a) indicate the power law fits to 
the tails of the distribution, while the dashed one represents a Gaussian best 
fit. The numbers in (a) reflect the corresponding scaling indices. 
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Figure 6: The superposed time series of normalized returns calculated accord- 
ing to eq. (0) for k = 1 (a), k = 2 (b) and k = 200 (c). 
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Figure 7: The superposed time series of normalized volatilities calculated ac- 
cording to eq. (g) for k = 1 (a), k = 2 (b) and k = 3 (c). 
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In this definition |f„| 2 is used instead of v\ for the reason of preserving nor- 
malization and the sign of in order not to destroy any possible coherence 
among the original signals. A collection of such superposed time series of re- 
turns for k = 1, 2 and 200 is shown in Fig. || As it is illustrated by an example 
of k = 200, for a statistical value of k such a superposed signal develops no 
coherent structures and g\ k (U) basically does not differ from a simple average. 
The first two differ however significantly and indicate the existence of the very 
pronounced, up to almost ten times of the mean standard deviations of the 
original time series, repeatable structures at the well defined instants of time 
through many days. As it is clearly seen, the two collective signals correspond 
to two disconnected and well determined periods of an enhanced synchronous 
market activity. The first (k = 1) of them corresponds to the period just before 
closing in Frankfurt during the time interval considered here, and the second 
one (k = 2) to the period immediately after 14:30, which reflects the DAX 
response to the North- American financial news release exactly at this time. It 
is also very interesting to see that in the first case (before closing) the coherent 
burst of activity expressed by g\ 1 (t) is oriented to the negative values while 
in the second case (just after 14:30) (g\ 2 (t)) it points predominantly to the 
positive values. Surprisingly, the DAX response to the Wall Street opening at 
15:30 develops no visible synchronous structure in neither of the eigenstates. 
The first three (k = 1,2,3) analogously superposed volatility signals 

JV 

vxdU) = Y, si Mv k a )\v h a \ 2 \g a (t l )\. (8) 

a=l 

are shown in Fig 0. The first of them is associated with the largest eigenvalue of 
the volatility correlation matrix and reflects the magnitude of average volatility 
as a function of time. The next two {k = 2, 3) constitute counterparts of the 
first two superposed return signals. Interestingly, this correspondence holds 
in reversed order, however. 

Another characteristics which carries some information about the stock 
market dynamics is the probability distribution of the eigenvector compo- 
nents u*. Several relevant histograms, either for single eigenstates or for a 
collection of them, are shown in Fig. ||. In both cases, of the returns as well 
as of the volatility correlation matrices, the distributions of eigenvector com- 
ponents from the bulk of the spectrum agree very well with a Gaussian. In 
the transition region, illustrated here by the eigenstates 3-7, some deviations 
can already be observed. Significantly different are the distributions for the 
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Figure 8: Distributions of eigenvector components v\ for k = 1, k = 2 and for 
the two sets k = 3 — 7 and k = 8 — 451 of eigenvectors. The upper (lower) 
part corresponds to the returns (volatility) correlation matrix. 
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outliers. In the case of the returns correlation matrix it is the second (k = 2) 
eigenvector whose distribution deviates more from a Gaussian; its components 
are concentrated more at zero but, at the same time, the tails of the distribu- 
tion are thicker. This indicates that fewer days (a's), but with a larger weight, 
contribute to the signal seen at 14:30 than to the one just after 17:00. The 
k = 1 and k = 2 eigenvector components of the volatility correlation matrix 
are distributed asymmetrically relative to zero, as consistent with the distri- 
bution of the entries of this matrix. In this second case (k = 2), a long tail of 
the negative eigenvector components develops, which makes the corresponding 
superposed volatility v\ 2 (ti) signal negative. 

The above decomposition of the stock market dynamics allows also to shed 
some more light on the issue of the probability distribution of price changes || . 
That the nature of such changes is very complex can be concluded by looking 
for instance at the probability distributions of fluctuations associated with 
different returns 'eigensignals' g\,(ti). Some examples are shown in Fig. ||. 
To quantify such characteristics like Levy stability or nonstability, based on 
this analysis, would definitely be premature for many reasons. One is the 
statistics which is here too poor fi~g[| . What one however can clearly see is 
that the probability density of fluctuations connected with the bulk of the 
spectrum drops down much faster than the ones connected with the more 
collective (k = 1,2) eigenvectors. In the first case a power law index, of the 
order of 5.8-5.9, can even be assigned, similar on both positive and negative 
sides. The k = 1 and k = 2 signals trace however a completely different, much 
thicker tailed, distribution fl9]| . Their extreme events, which can be qualified 
as outliers [20], carry essentially the same features as the ones identified Q| 
in the Dow Jones draw downs on much longer time scales. 



4 Summary 

The present study quantifies several characteristics relevant for understand- 
ing the dynamics of the stock market time evolution. One principal related 
issue is a question of how the market inefficiency manifests itself. For the 
Deutsche Aktienindex our study thus shows that on the time scales of up to 
one trading day there exist two well defined short periods of the spectacularly 
synchronous repeatable bursts of activity during the intraday trading between 
9:03 and 17:10, a phenomenon somewhat in the sprit of an idea of marginally 
efficient markets p2[ . On the other hand it turns out that generically the con- 
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Figure 9: Probability density functions of fluctuations of the superposed re- 
turns as expressed by the Eq. (|?]). Squares correspond to k = 1, triangles to 
k = 2 and the circles to the average of k = 11 — 451. Both, positive (upper 
part) and negative (lower part) sides of those distributions are shown. 
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secutive returns carry essentially no common information even when probed 
with the frequency of 15 s. The fluctuations associated with the so identi- 
fied distinct components are governed by the different laws which reflects an 
extreme complexity of the stock market dynamics. 
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